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ABSTRACT 

Many chloroplast transcripts are protected 
against exonucleolytic degradation by RNA-binding 
proteins. Such interactions can lead to the accumu- 
lation of short RNAs (sRNAs) that represent foot- 
prints of the protein partner. By mining existing 
data sets of Chlamydomonas reinhardtii small 
RNAs, we identify chloroplast sRNAs. Two of these 
correspond to the 5 -ends of the mature psbB and 
psbH messenger RNAs (mRNAs), which are both 
stabilized by the nucleus-encoded protein Mbb1, a 
member of the tetratricopeptide repeat family. 
Accordingly, we find that the two sRNAs are 
absent from the mbb1 mutant. Using chloroplast 
transformation and site-directed mutagenesis to 
survey the psbB 5' UTR, we identify a c/s-acting 
element that is essential for mRNA accumulation. 
This sequence is also found in the 3 UTR of psbH, 
where it plays a role in RNA processing. The two 
sRNAs are centered on these c/s-acting elements. 
Furthermore, RNA binding assays in vitro show that 
Mbb1 associates with the two elements specifically. 
Taken together, our data identify a conserved c/s- 
acting element at the extremity of the psbH and 
psbB 5 UTRs that plays a role in the processing 
and stability of the respective mRNAs through inter- 
actions with the tetratricopeptide repeat protein 
Mbb1 and leads to the accumulation of protected 
sRNAs. 



INTRODUCTION 

In the chloroplast, post-transcriptional steps play a major 
role in the control of gene expression. Many chloroplast 
genes are part of polycistronic transcription units, and 
RNA maturation is complex. It involves endonucleolytic 
and exonucleolytic processing at the 5'-end, the 3'-end and 
in intergenic spacers, intron splicing and in plants, RNA 
editing (1-4). These maturation events in turn influence 
messenger RNA (mRNA) translation (5-7). RNA matur- 
ation, RNA stability and translation are regulated by de- 
velopmental programs and by environmental factors such 
as light or nutrient availability (6,8,9). Numerous nucleus- 
encoded factors are imported in the chloroplast where 
they govern these post-transcriptional events (4,10). 
Most of these factors are highly specific and generally 
target only one or a few genes. A prominent example for 
such RNA-binding proteins is the members of the helical- 
repeat protein super-family, which fulfill various tasks for 
the maturation of organellar RNAs and include 
pentatricopeptide repeat (PPR), octotricopeptide repeat 
(OPR) or TPR/HAT (tetratricopeptide repeat/half a 
tetratricopeptide repeat) proteins (11-16). The prototyp- 
ical example of helical-repeat proteins is Pumilio, where 
each repeat is composed of three alpha-helices that 
interact to provide a super-helical scaffold. Each repeat 
presents specific amino acid residues that bind to 1 nt of 
the RNA substrate (17). The OPR family has expanded 
during the evolution of Chlamydomonas reinhardtii, 
whereas it is the PPR family that is most prominent in 
the higher-plant lineages (12-14). Members of these 
helical-repeat protein families can protect chloroplast 
RNAs against exonucleolytic degradation by tightly 
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binding sequences in the UTRs and thus increase the sta- 
bility of their substrates (18-24). A well-studied example is 
the binding of PPR10 to its target sequences, which 
impedes the progression of both 5'- and S'-exonucleases 
and can thus protect either the downstream or the 
upstream RNA, respectively (22,23). Intriguingly, PPR10 
as well as other helical repeat proteins generate short, 1 5- 
30-nt-long sRNAs simply by protecting the bound RNA 
segment, i.e. the footprint, against exonucleolytic degrad- 
ation (22-24). Almost 100 such sRNAs are found in plant 
chloroplast transcriptomes, many corresponding to ends 
of transcripts and to known or presumed protein binding 
sites (25-27). Consequently, sRNAs can be used as a 
proxy to identify binding sites of RNA-binding proteins. 
To date, sRNA data sets have been presented for different 
species of angiosperms, namely, barley, maize, Arabidopsis 
and Chinese cabbage (25,26,28). Whether sRNAs are 
present in chloroplasts of other lineages is at present 
unclear, although this is suggested by the wide evolution- 
ary distribution of chloroplast-targeted helical-repeat 
proteins. 

Here, we identify chloroplast sRNAs of the green alga 
C. reinhardtii in public data sets from high-throughput 
RNA- sequencing experiments. We show that some 
sRNAs co-localize with transcript ends and can be 
detected by RNA gel blot analysis. To investigate their 
biological significance, we focus on two sRNAs that 
map to the psbB/psbT/psbH gene cluster, which is 
transcribed as a unit and processed to give rise to the 
monocistronic psbB and dicistronic psbB/T mRNAs, as 
well as to several forms of psbH RNA, all of which 
encode subunits of PSII (29,30). The nucleus-encoded 
factor Mbbl is specifically required for the stable accumu- 
lation of all the transcripts from this cluster (29,31). The 
analysis of reporter constructs has shown that Mbbl acts 
through the 5' untranslated region (5' UTR) of psbB, and 
there is genetic evidence that it also acts directly on 
psbH (19). 

Mbbl is one of the rare RNA-binding proteins in 
chloroplasts for which an ortholog can be identified in 
higher plants. This ortholog, named HCF107 (high 
chlorophyll fluorescence 107), is required for expression 
of psbB and psbH (32,33). In the hcfl07 mutant, RNA 
processing upstream of psbH is deficient and its transla- 
tion is impaired. Translation of psbB is also defective in 
this mutant (32), even though the pattern of psbB tran- 
scripts appears normal. In vitro assays have demonstrated 
that recombinant HCF107 binds the 5'-end of the psbH 
transcript and can protect it against exonucleolytic deg- 
radation from either the 5' or the 3' side (24). An sRNA 
representing the footprint of HCF107 is detected in vivo. 
Thus, the TPR/HAT protein HCF107 seems to act simi- 
larly to PPR proteins like PPR10, in line with their similar 
predicted helical-repeat structure (34). 

Here we show that the sRNAs mapping to the ends of 
the 5' UTRs of psbH and psbB are missing in the mbbl 
mutant, suggestive of a direct functional link between 
Mbbl and these short RNA segments. Using chloroplast 
site-directed mutagenesis, we demonstrate the importance 
of the corresponding sequence elements for mRNA stabil- 
ity in vivo by a systematic genetic survey of the entire psbB 



5' UTR and of conserved sequences in the psbH 5' UTR. 
Association of Mbbl with this RNA sequence element is 
demonstrated by in vitro binding assays. 

MATERIALS AND METHODS 

Strains and media 

The C. reinhardtii mbbl-222E mutant strain was described 
previously (29). For phenotypic analysis of mutant strains 
(spot-tests), 2 ml of culture was grown overnight in Tris 
acetate phosphate medium in the dark, and then 15ul 
aliquots were spotted on agar plates containing Tris 
acetate phosphate or high salt minimum (35). Biolistic 
transformation of C. reinhardtii and selection on spectino- 
mycin were described previously (19). 

RNA and protein analysis 

RNA was extracted using TriReagent (Sigma-Aldrich) 
and analyzed by agarose gel electrophoresis, capillary 
transfer to nylon membranes and hybridization with 
32 P-labeled probes as described previously (36). The 
psbB probe was a 1.1 -kb Ncol — EcoRI fragment of 
p38ANco. The psaA exon 3 probe was a 2.2-kb 
EcoRI— Hindlll fragment of R17EH4 (37). The psbH 
probe was a 250-bp polymerase chain reaction (PCR) 
fragment amplified with the two oligonucleotides 5' 
PSBH and 3' PSBH. The 5' psbH probe was a 309-bp 
Pad — XmnI fragment, and the 3' psbH probe was a 
484-bp Aatll — EcoRI fragment. Probes were stripped in 
0.5% sodium dodecyl sulphate at 60°C for 1 h, and the 
membranes were checked for residual signal using a 
phosphor screen. The autoradiographs shown in Figure 
1 were all obtained using one membrane that was repeat- 
edly stripped and reprobed. 

Primer extension mapping of the psbB and psbH 5 r 
UTR was performed with 10|ig of RNA, 32 P-labeled 
oligonucleotides and M-MLV reverse transcriptase 
(Promega) at 50° C essentially as described (38). 

Chlamydomonas sRNA data were retrieved from 
Ibrahim et al. (39), available from the National Center 
for Biotechnology Information as GEO Series 
GSE17815, including reads from four different libraries: 
two libraries had been prepared from a knockout line of 
the terminal nucleotidyltransferase and two libraries from 
the knockdown of a tryptophan synthase beta subunit. 
Three further libraries from the Comparative Sequencing 
of Plant Small RNAs project are available as GEO Series 
GSE32457. They had been prepared from a wild-type 
(WT) Chlamydomonas strain grown under normal condi- 
tions, under phosphate starvation or under sulfate starva- 
tion. sRNA sequences represented by > 1 read per sample 
were mapped to the chloroplast genome (NC_005353) 
using CLC Genomics Workbench (version 6.0.1) 
allowing one mismatch in the core sequence. sRNAs 
with >15 reads were extracted. 

sRNAs were enriched according to (40) from total RNA 
prepared as described by (41). RNA was resolved on 15% 
polyacrylamide gels containing 8-M urea and transferred 
to Hybond-N nylon membranes (GE Healthcare). 
Membrane-bound RNAs were chemically cross-linked as 
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Figure 1. sRNAs correspond to the 5'-ends of chloroplast mRNAs in 
Chlamydomonas. (A) The sequences corresponding to four chloroplast 
sRNAs (Supplementary Table SI) are highlighted in bold. The position 
of the 5'-ends as mapped by RACE is shown with arrowheads. The 
numbers above the arrowheads indicate the number of 5'-RACE clones 
that end at the respective sites. Most clone ends coincide with sRNA 5'- 
ends. (B) Sequence alignment of the psbB 5' and psbH 5' sRNAs. 
Asterisks indicate conserved positions. (C) Autoradiograms of an 
RNA blot of chloroplast RNA from the WT and the mbbl mutant 
hybridized with the radioactive probes indicated above the panels. 
The arrows indicate the positions of the sRNAs. Low molecular 
weight RNA was enriched from total RNA and separated on a 15% 
sodium dodecyl sulphate-polyacrylamide gel electrophoresis containing 
8M urea. A fluorescence image of the gel lanes stained with ethidium 
bromide is shown as loading control in the leftmost panel. The probes 
were single-stranded DNA oligonucleotides (<25nt) antisense to the 
sRNAs. The same membrane was repeatedly probed and stripped as 
described in Materials and Methods section. 



described by (42). DNA oligonucleotides were end-labeled 
using polynucleotide kinase (Thermo Scientific) and 
y 32 P-ATP (Hartmann- Analytics) and used as probes. 

The 5' and 3' RACE experiments were carried out as 
described (25), with primers described in Supplementary 
Table SI. 

Protein samples were prepared and analyzed by 
immunoblotting as described previously (36). 

Site-directed mutagenesis 

The mutations were introduced in a transformation vector 
[p38ANco; (19)] containing an aadA cassette, which 
confers spectinomycin resistance as a selectable marker 
for chloroplast transformation (43). The marker was 
inserted at position —320 and was transcribed in the 



opposite direction relative to psbB/T/H (Supplementary 
Figure SI). In this vector, the two nucleotides preceding 
the ATG start codon were changed to create an Ncol 
restriction site; this change did not measurably affect 
psbB/T/H mRNA expression but reduced the translation 
of the PSII subunits and the growth rate of the strains 
[Supplementary Figure S2; (19)]. Restriction sites used 
for these constructs are shown in Supplementary 
Figure SI. The mutant psbB 5 r UTR fragments (m3-8 to 
m32-37) were generated by PCR with the sense primer 
StuNde and the respective antisense primer (Apa-3 to 
Apa-32). The PCR products were cloned in the 
Smal site of pBluescriptKS(+), from which they were 
excised with Ndel and Ncol. For m37-42 and m43-48, 
the WT 325-bp Ndel — Ncol fragment inserted in 
pBluescriptKS(+) was used as a template for PCR 
around the entire plasmid with 5' phosphorylated oligo- 
nucleotides containing half of the Apal at their Spends: 
s-37>-15 with as-40>-69, or s-45>-15 with as-45>-74, re- 
spectively. The PCR product was ligated and cloned, and 
the mutant UTR fragments were excised with Ndel and 
Ncol. The sequences of the mutant fragments were 
verified by sequencing. The Ndel — Ncol fragments were 
inserted in the vector P38.NcoA0 (19) digested with the 
same enzymes. Finally, the spectinomycin resistance 
cassette from pUCatpXaadA (43) was inserted as a 
1.9-kb EcoRV Smal fragment into the StuI site to 
obtain the mutant transformation vectors. 

The psbH transformation vector p41 A was prepared as 
follows. A 3216-bp BamHI — Seal genomic fragment was 
cloned in the pBluescriptKS(+) vector digested with 
EcoRV and BamHI (plasmid p41). Then the atpXaadA 
cassette was inserted as a 1.9-kb EcoRV — Smal fragment 
in the MscI site, in the same orientation as psbH (p41A). 
The mutant psbH 5 f UTR fragments were generated with 
the antisense primer Sta-as and the respective sense primers 
ApaS, ComS and DelS. These were cloned in pCRII-Topo 
(Invitrogen), verified by sequencing, excised with Pad and 
Afel and inserted into p41A digested with the same 
enzymes to obtain the mutant transformation vectors. 

For psbB transformation, the C. reinhardtii recipient 
hosts were obtained as follows. A vector (pANcoA3) 
was constructed by replacing the NcoI-SacII fragment 
of p38ANco (19) with the recyclable aadA cassette (44) 
excised with SacI and Kpnl from pKS483<2<2^L4483. This 
vector was used to transform C. reinhardtii (WT or mbbl- 
222 E), and homoplasmic strains were obtained by 
repeated selection on spectinomycin. Further growth in 
absence of spectinomycin allowed intrachain recombin- 
ation between the 483-bp repeats (44) and obtention 
of homoplasmic psbB 5' UTR deletion strains (non- 
photosynthetic, spectinomycin-sensitive), which were 
verified by Southern blotting. One such deletion strain 
(A 3 ) was used as a host for transformations with the 
linker-scan mutations of psbB in vector p38ANco to 
ensure that the desired mutations were incorporated 
together with the selectable marker (Supplementary 
Figure SI). 

For psbH transformation, the C. reinhardtii recipient 
host was obtained as follows. A vector (p41K7R) was 
constructed from p41 by replacing the XmnI — MscI 
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segment with the recyclable aadA cassette (44). This was 
done in two steps: first inserting the cassette (amplified 
from pKS483<2<2^L4483 with phosphorylated Ml 3 and 
Ml 3 reverse primers) in the MscI site, in the same orien- 
tation as psbH; in the second step, the 1.8-kb Apal 
fragment (one site in the plasmid vector, one site at the 
edge of the cassette) was replaced with a 0.7-kb Apal — 
XmnI fragment to obtain p41K7R. This vector was used 
as described above to transform C. reinhardtii and obtain 
homoplasmic psbH deletion strains (non-photosynthetic, 
spectinomycin-sensitive), which were verified by PCR. 
Such a deletion strain (AH) was used as a host for trans- 
formations with the psbH mutations in the p41A vector, 
so as to ensure that the mutations were inserted together 
with the selectable marker, similar to what is described 
earlier and in Supplementary Figure SI for psbB mutants. 

Electrophoretic mobility shift assay 

Protein crude extracts were prepared from a strain ex- 
pressing a version of Mbbl tagged with a triple hemagglu- 
tinin (HA) epitope obtained by rescuing the mbbl-222E 
mutant with the Mbbl gene fused to a triple HA tag at the 
3'-end (31), and as a negative control a strain expressing 
the gene coding HA-tagged sedoheptulose-l,7-bisphos- 
phatase from Dunaliella salina. 

A total of 1 x 10 8 cells in exponential phase were col- 
lected and re-suspended in 1 ml of extraction buffer 
(80 mM NaCl, lOmM Tris HC1, 1 mM ethylenediaminete- 
traacetic acid, 2mM dithiothreitol, 5% glycerol, pH 8.0, 
and 1 x Sigma protease inhibitors). The cells were lysed by 
adding 500 ul of glass beads (diameter 0.3 mm) and vigor- 
ous agitation for 2x1 min. The lysate was centrifuged for 
12min at 13 000g to pellet cell debris. Partial purification 
of Mbbl -HA and Sbp-HA was performed using an anti- 
HA affinity matrix (Roche) according to the recommen- 
dations of the manufacturer, using a synthetic HA-peptide 
(Genescript Inc., USA) for the elution. 

For the preparation of the RNA probes, templates cor- 
responding to the WT psbB 5 r UTR and the m26-31 mutant 
psbB 5' UTR were prepared by PCR using as templates the 
p38ANco and the p38ANcoApa-26 plasmids that were 
used for site-directed mutagenesis. The template was 
amplified with the sense primer T7-35 (which contains 
the T7 promoter) and the corresponding antisense primer 
psbB-rev3-probe. RNAs were in vitro transcribed in the 
presence or absence of 32 P-UTP using T7 RNA polymerase 
(Pr omega). The probes obtained extend from the psbB pro- 
cessing site at —35 to position +64 relative to the AUG. 
The probes were verified by gel electrophoresis and 
quantified by scintillation counting ( 32 P-labeled RNA) or 
absorbance at 260 nm (competitor RNA). 

Binding reactions contained 150mM NaCl, 50 mM 
Tris-HCl (pH 7.5), 4mM dithiothreitol, 0.04mg/mL 
bovine serum albumin, 0.5mg/mL heparin, 15% 
glycerol, 10 U of RNAsin (Promega), 0.5% Triton 
XI 00, 15pM radiolabeled RNA and protein amounts as 
indicated. Reactions were incubated for 20 min at 25°C 
and resolved on 5% native polyacrylamide gels. The 
radioactive signals were detected with a phosphorimager 
(BioRad Molecular Imager FX). 



RESULTS 

Identification of chloroplast sRNAs within 
Chlamydomonas RNA-seq data sets 

Small RNAs such as small interfering RNAs and micro 
RNAs can be discovered by high-throughput sequencing 
of their cDNAs. Such approaches have been carried out 
in a number of autotrophic organisms including 
Chlamydomonas (39). Starting from total cellular RNA, 
short RNAs were separated by gel electrophoresis, 
eluted and used to prepare a template library for high- 
throughput sequencing. These data sets include short 
RNAs derived from all three genomes in the nucleus, the 
chloroplasts and the mitochondria. 

We pooled reads from seven libraries originally 
designed to analyze micro RNAs and small 
interfering RNAs in Chlamydomonas (39) and from the 
Comparative Sequencing of Plant Small RNAs project 
(http://smallrna.udel.edu). For each library, identical 
reads were combined and called sequences. A total of 
17400 sequences were mapped to the chloroplast 
genome. This corresponds to 2.2% of all sequences in 
the seven libraries. Such sequences can represent up to 
hundreds of individual reads. The identified sequences 
are scattered across the entire chloroplast genome with 
distinct local enrichments. Genes known to be highly ex- 
pressed like the rrn operon, or the psbA and rbcL mRNAs 
show the expected higher sequence densities compared 
with the rest of the genome. For further analysis, we 
excluded such extended regions of high sequence density, 
as we were interested in isolated and narrow peaks of 
sequence density — a hallmark of sRNAs (25). We 
focused on non-coding regions as prime targets of RNA- 
stabilizing proteins and disregarded all sequence clusters 
within coding sequences. Furthermore, only clusters of at 
least 3 overlapping sequences, and representing >10 reads, 
were retained. The core of each cluster was defined as 
those nucleotides represented in 50% of all sequences 
within the cluster. A set of 61 of such clusters, which we 
henceforth call sRNAs, was identified (Supplementary 
Table S2). 

The mbbl mutant fails to accumulate two sRNAs from 
its target UTRs 

Angiosperm chloroplast sRNAs in non-coding regions 
show a bias in location toward transcript ends — consistent 
with the idea that they are generated by the protective 
action of helical-repeat proteins. In our data set, we find 
seven sRNAs to map immediately down- or upstream to 
known transcript ends (Supplementary Table S2). We con- 
firmed the ends of four of these mRNAs by rapid ampli- 
fication of cDNAs (RACE, Figure 1A). Two of these 
sRNAs are located in the psbB and psbH 5' UTRs, 
which are the proposed target regions of Mbbl. To test 
whether Mbbl is functionally linked to the accumulation 
of these sRNAs, we probed total RNA from the mbbl null 
mutant by RNA gel blot hybridization (Figure 1C). We 
could detect neither of these two sRNAs in the mutant 
RNA extract, while both sRNAs accumulated in WT 
RNA preparations. The specificity of the loss of psbB 
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Figure 2. Mutational analysis of the psbB 5' UTR. (A) Map of the psbB/psbT/psbH locus. The gray bars represent the open reading frames of psbB, 
psbT and psbH, and below the line of the putative psbN, which would be transcribed from the opposite strand. Restriction sites relevant to this work 
are indicated at the top, and the probes used for RNA blot hybridization are shown below as white bars. (B) The 5' UTRs of the psbB transcripts. 
The major form of the psbB 5' UTR begins at —35 relative to the start codon. A minor form, which is two orders of magnitude less abundant, begins 
at —147 (19). (C) Linker-scan mutagenesis of the psbB 5' UTR. The sequence of the WT is shown at the top with the translation start codon of psbB 
highlighted in bold and the S-box underlined. The corresponding sRNA (psbB 5') is shown in lowercase above the sequence. The black arrow shows 
the positions of the major 5'-end at —35. The sequences of the chloroplast transformation vector p38ANco and of the eight linker-scan mutants are 
aligned below, with the Apal sites in italics and with dots representing unchanged bases. 



and psbH sRNAs in mbbl was confirmed by the presence 
of the psbH J'-antisense sRNA (Figure 1C). It is striking 
that Mbbl is required for the accumulation of the psbB 5 f 
and the psbH 5 f sRNAs, as both correspond to mRNAs 
that depend on the presence of Mbbl for their stable ac- 
cumulation (29). A comparison of the psbB and psbH 5' 
UTRs showed that they share an identical sequence, AAG 
UAAA, at the same positions relative to their respective 
processing sites (Figure IB). This sequence is at the core of 
both sRNAs, and will be referred to as 'the S-box'. In the 
following, we analyzed the functional significance of both 
S-box sequence elements by mutational analyses in vivo 
using chloroplast transformation. 

A c/s-acting element required for psbB mRNA 
accumulation 

To systematically investigate the function of c/s-acting 
elements in the psbB 5 f UTR, and in particular the short 
form beginning at —35, which had not been previously 
investigated, we designed a series of mutations. In a se- 
quential manner, six bases of the sequence were replaced 
by GGGCCC, the recognition site for the restriction 
enzyme Apal, a strategy dubbed linker-scan mutagenesis 
(Figure 2). The successive mutations spanned the region 
—3 to —48 relative to the translation start codon, thus 
extending 13 nt upstream of —35, the 5'-end of the 



major form of the psbB 5' UTR. Some of the mutations 
overlapped so that every base in the sequence had been 
mutated in at least one of the constructs. To facilitate the 
recovery of homoplasmic mutant lines, we first generated 
a recipient strain that carried a substitution upstream 
of the ATG of psbB in the chloroplast genome 
(Supplementary Figure SI). This ensured that all 
transformants with the aadA insertion also carried the 
desired mutations and excluded the possibility that WT 
psbB 5 r UTR sequences be retained in the mutant lines 
because of homologous recombination between the select- 
able marker and the mutation. The mutated vectors were 
used for biolistic transformation of the chloroplast 
genome and selection on spectinomycin-containing 
media. After several rounds of sub-culturing on spectino- 
mycin, the selected lines were homoplasmic and harbored 
the expected mutations as shown by PCR amplification 
followed by Apal digestion (Supplementary Figure SIB). 

To investigate the effect of the mutations on mRNA 
accumulation, we analyzed the transcripts of the psbB/T/ 
H cluster. RNA blots were hybridized with a probe 
specific for psbB to reveal the monocistronic form as 
well as the dicistronic transcript psbB/T (Figure 3A). In 
the WT MBB1 background, the most dramatic reduction 
of the psbB mRNA and of the dicistronic psbB /T mRNA 
was with the mutation between positions —26 and —31 
(m26-31), for which the mRNAs were not detectable. 
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Figure 3. RNA analysis of the psbB 5' UTR mutants. (A) RNA blot hybridization analysis of the psbB/T/H transcripts. Total RNA was analyzed by 
agarose gel electrophoresis and blot hybridization using the probes indicated to the left of each panel. A probe for exon 3 of psaA was used as a 
control. (B) Primer extension analysis of the psbB RNAs. Total RNA was used as a template for reverse transcription using oligonucleotide primers 
specific for psbB, psbD or both as indicated at the top of the panel. The psbD primer serves as an internal control. Owing to the abundance of psbD 
mRNA, background precludes the detection of the longer form of psbB RNA starting at —147, which is two orders of magnitude less abundant than 
the form at —35 (19). The panel on the right shows sequencing reactions of the psbB 5' UTR with the psbB primer. 



For the flanking mutations m21-26 and m32-37, the levels 
were approximately one-third of the WT (Figure 3 A). In 
the other mutants, the amounts of these mRNAs were in 
the WT range. It is noteworthy that the psbB 5 r sRNA 
coincides with the region where mutants display reduced 
RNA accumulation. 

The RNA blots were then hybridized with a probe for 
psbH, which detected two transcripts that differ by the 
length of their 3' UTR (30). In the WT background, the 
psbH transcripts were detectable at normal levels in all the 
mutants, and their accumulation was even increased in the 
strains where psbB mRNA was less abundant (Figure 3A). 
This suggests that psbB and psbH may compete for a 



common limiting factor that confers RNA stability (see 
later). 

Mutant psbB transcripts are accurately processed at —35 

In certain mutants, a normal pattern of psbB RNAs was 
present as determined by RNA blotting, but the possibility 
remained that in some mutants, the 5'-end of the tran- 
scripts could be changed to a different position. To 
address this question, we determined by primer extension 
the end of the 5' UTR of the psbB transcripts (Figure 3B). 
We found that the differences of accumulation of the tran- 
scripts ending at -35 among the mutated strains correlated 
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well with the levels detected by RNA blot analysis for the 
psbB and psbB/T RNAs. Moreover, the length of the 5' 
UTR was not affected by any of the mutations for which 
RNA is present, even for the mutation m32-37, which 
alters the sequence at the position of the major 5'-end 
(—35). This shows that processing does not require the 
recognition of a specific sequence at the exact site corres- 
ponding to the end of the mature product. A similar situ- 
ation was observed for the psbD 5 f UTR, where 
substitutions of sequences around the processing site 
(—47) do not alter the position of processing (20). 
Likewise, substitution of four bases at the processing site 
in the 5' UTR of psbA did not change the position or 
extent of processing (45). 

We also determined the effects of the mutations 
on growth and on the accumulation of the psbB gene prod- 
uct CP47 (Supplementary Figure S2). The mutant strains 
were compared with the vector control (p38ANco), which 
accumulates normal levels of psbB/T mRNAs (19), but 
has an Ncol restriction site engineered at the ATG start 
codon, which reduces translation efficiency and photosyn- 
thetic growth. Phototrophic growth of the strains whose 
mutations are located upstream of the —35 end (m37-42 
and rn43-48) was similar to the vector control (p38ANco). 
The m32-37 mutant, where mRNA levels were strongly 
reduced (Figure 3A), grew photoautotrophically, albeit 
more slowly than the vector control. It was previously 
reported for other chloroplast mRNAs that their abun- 
dance under normal conditions is not limiting for transla- 
tion (46). All mutations within the short leader (strains 
m3-8 to m26-31) further reduced phototrophic growth 
below vector-control levels (p38ANco). The levels of 
CP47 in the mutant strains were consistent with their 
capacity for photosynthetic growth (Supplementary 
Figure S2), suggesting that the mutations within the 
short 5' UTR may affect translation efficiency. 

The psbH 5' UTR contains a copy of the S-box, which is 
required for processing and translation 

To analyze the importance of the sequence element repre- 
sented by the psbH 5 r sRNA for the maturation and sta- 
bility of psbH mRNA in vivo, we mutated its conserved 
core motif, dubbed the S-box. Three different mutations 
of the S-box were introduced in the chloroplast genome 
(Figure 4A), with the S-box replaced by an Apal site 
(ApaS), by the complementary sequence (CompS) or 
deleted (DelS). 

The three S-box mutants failed to grow on minimal 
medium, and immunoblotting showed that they 
accumulated no detectable PsbH or CP47, and only low 
levels of Dl and CP43, other subunits of PSII (Figure 4B), 
as compared with the Rubisco control. This probably re- 
flected a lack of PsbH translation and subsequent proteo- 
lytic degradation of the other core subunits of the PSII 
complex (47). The psbH transcripts were examined by 
RNA blotting (Figure 4C). In the WT, several forms of 
psbH mRNAs were present, which were previously 
determined to differ in the length of their 3' UTRs (30). 
However, in the S-box mutants, these were absent and 
longer forms accumulated. To determine how these 



longer forms differed from the WT, we compared RNA 
blots hybridized with probes for the region upstream of 
the psbH 5 f UTR or for the coding sequence (Figure 4D). 
The upstream probe detected the longer transcripts in an 
S-box mutant (ApaS), but not in the WT. This indicated 
that the longer transcripts in the mutant had 5' extensions 
and was confirmed by mapping the psbH RNAs by primer 
extension (Supplementary Figure S3). In the WT, we 
observed the mature psbH 5'- ends at position —54, but 
we did not find an end at position —78 as previously 
reported (30). In the ApaS S-box mutant, no transcripts 
ending at —54 were detected, but a major band was found 
at —617. In the WT, a band migrated to the same position, 
but was present in lower amounts. The results of primer 
extension were consistent with those of RNA blotting, and 
showed that psbH transcripts with a 5' extension accumu- 
late in the S-box mutants. These data indicate that the 
S-box is important for processing of psbH RNA at —54, 
and/or for the stability of the shorter transcripts. 
Interestingly, the psbB transcripts accumulated to higher 
levels in the psbH S-box mutants (Figure 4C), again sug- 
gesting a competition of these transcripts for a common 
factor. 

Mbbl associates with the psbB and psbH 5' UTRs in vitro 

It has been shown that Mbbl acts through the 5' UTR of 
psbB to allow stable accumulation of the corresponding 
transcript and may also play a direct role in the accumu- 
lation of psbH (19). The identification of a conserved 
sequence present on both the psbB and psbH 5 f UTRs, 
the accumulation of sRNAs encompassing these elements 
and the effect of mutations of these elements on the mat- 
uration of the corresponding transcripts suggest that these 
may be the target sites of Mbbl. To confirm this hypoth- 
esis, we ran electrophoretic mobility shift assays to deter- 
mine whether Mbbl associates with the mRNAs. Because 
our attempts to obtain recombinant Mbbl expressed in 
Escherichia coli in soluble form were not successful, we 
used immunoprecipitation to enrich Mbbl::HA from 
Chlamydomonas extracts. Immunopurified proteins were 
incubated with in vitro transcribed RNAs corresponding 
to the psbB 5' UTR. As shown in Figure 5A, the psbB 5' 
UTR is bound by the Mbbl -HA immune-precipitate but 
not by the Sbp-HA control (sedoheptulose-l,7-bisphos- 
phatase, an enzyme of the Calvin-Benson cycle). The 
assays performed with increasing protein amounts 
resulted in increased shifting of the radiolabeled probe. 
The specificity of RNA-binding was investigated by 
comparing competition of the radiolabeled probe with 
increasing concentration of unlabeled WT probe or a 
mutant probe carrying the m26-31 substitution that 
abolishes psbB stability in vivo (Figure 5B). A 10-fold 
excess of the WT psbB 5' UTR was sufficient to com- 
pletely compete the signal, whereas competition with the 
m26-31 probe required a 100-fold excess. These results 
indicate that Mbbl associates with psbB mRNA, and 
that this interaction involves the sequence element (S- 
box) that is essential for psbB mRNA stability in vivo. 
This association could be direct, but could also involve 
other proteins that are present in the immunoprecipitate. 
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Figure 4. Mutational analysis of the conserved S-box in the psbH 5' UTR. (A) Mutagenesis of the psbH 5' UTR. The sequence of the psbB 5' UTR 
is shown at the top for comparison, and the sequence of the psbH 5' UTR below it, with the translation start codons in bold. The conserved 'S-box' 
is highlighted in bold and underlined. The sequences of the sRNAs are shown in lowercase above the respective sequences. The black arrows show 
the positions of the major 5'-ends (see Figure 2 and Supplementary Figure S3). The transformation vector (p41A) carries a selectable aadA spec- 
tinomycin resistance cassette inserted downstream of psbH (Materials and Methods). The Chlamydomonas host strain (AH) carries a substitution of 
the psbH gene and its 5' UTR to ensure that transformants carrying the aadA marker also bear the desired mutation. (B) Immunoblot analysis of 
PSII components. Total protein extracts were analyzed by sodium dodecyl sulphate-poly acrylamide gel electrophoresis and immunoblotting with the 
antibodies indicated on the left. From left to right, the samples are the WT, the mbbl-222E mutant (mbbl), the AH transformation host (AH), the 
psbB m26-31 mutant (m26-31), the transformation host rescued with a WT psbH vector (p41A) and the three mutants shown in (A). (C) RNA blot 
hybridization analysis of the psbB/T/H transcripts. Total RNA was analyzed by agarose gel electrophoresis and RNA blot hybridization using the 
psbH and psbB probes as indicated on the left. A probe for psaA was used as a control. (D) Mapping of the psbH transcripts. The schematic map 
shows the positions of the 5'-end at —54/— 53 with a black arrow and of the 3' termini with white arrows [according to (30)]. The upstream probe (5') 
and coding sequence probe (cod) are depicted below the map as open bars. Total RNA from the WT and a representative mutant (ApaS) was 
analyzed by agarose gel electrophoresis and RNA blot hybridization using the probes indicated at the bottom of each panel. 



In a parallel set of experiments, we used a radiolabeled 
probe corresponding to the 5'UTR of psbH, and also 
observed increased shifting with increasing amounts of the 
Mbbl -HA immunoprecipitate (Figure 5C). The specificity 
of RNA-binding was tested by comparing competition with 
increasing concentration of unlabeled WT probe or a 
mutant form carrying the ApaS substitution, which affects 
psbH processing in vivo (Figure 5D). The WT probe 
competed much more efficiently than the mutant version. 
These results indicate that Mbbl also associates with psbH 
mRNA and that the interaction requires the S-box that is 
essential for proper RNA processing in vivo. A closely 
related sequence is also found upstream of psbH in higher 



plants (AAGUcAA, —36 to —30), and is bound in vitro by 
recombinant HCF107, the plant ortholog of Mbbl (24). 
Thus, it is likely that Mbbl also interacts directly rather 
than indirectly with its two RNA targets in Chlamydomonas. 



DISCUSSION 

sRNAs help the identification of as-acting elements 
important for RNA processing and stability in the 
Chlamydomonas transcriptome 

We have screened the Chlamydomonas transcriptome for 
sRNAs as a proxy to identify possible target sites of 
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Figure 5. Electrophoretic mobility shift assay with Mbbl-HA immunoprecipitates. (A) Binding of Mbbl-HA to the 5' UTR of psbB. A radiolabeled 
probe corresponding to the mature psbB 5' UTR was incubated with increasing amounts of Mbbl-HA immunoprecipitate (0.1, 1, 2.5 and 4.5 fig) and 
resolved by native polyacrylamide gel electrophoresis. Bound (B) and unbound (U) probes are marked. A similar experiment was performed with 
increasing amounts of Sbp-HA (sedoheptulose 1,7 bisphophatase) as a negative control. (B) Competition of RNA binding with WT or mutant psbB 
5' UTR. One microgram of Mbbl-HA protein extract was incubated with a radiolabeled probe corresponding to the WT psbB 5' UTR and an excess 
(10-, 50- or 100-fold) of an unlabeled competitor corresponding either to the m26-31 psbB 5' UTR (left) or the WT psbB 5' UTR (right). (C) Binding 
of Mbbl-HA to the 5' UTR of psbH. A radiolabeled probe corresponding to the mature psbH 5' UTR was incubated with increasing amounts of 
Mbbl-HA immunoprecipitate as in (A). (D) Competition of RNA binding with WT or mutant psbH 5' UTR. Competition was performed as in 
(B) with WT or mutant (ApaS) psbH 5' UTR unlabeled RNA. 



chloroplast RNA-binding proteins. We found several 
sRNAs that map to UTRs of mRNAs, adjacent to 
known transcript ends (Supplementary Table SI). 
Among RNA-binding proteins, helical-repeat proteins 
are known to generate sRNAs by binding to transcripts 
and protecting their footprints against exonucleolytic deg- 
radation (2). Other RNA-binding proteins may also con- 
tribute to the generation of footprints. For example, we 
found sRNAs that may represent binding sites of Rbp63, 
Nac2 and Mcdl in the psbA, psbD and petD 5' UTRs, 
respectively, as these proteins are known to interact with 
the UTR regions of their target mRNAs (18,48,49). The 
interaction of these proteins with their RNAs awaits fine- 
mapping, possibly aided by sRNA location, similar to 
what we have shown here for Mbbl. It will be interesting 
to investigate whether there are further links between 
sRNAs, previously determined functional c/s-elements 
for RNA processing and RNA-binding proteins. For 
example, there are RNA stabilizing elements in the psbD 
5' UTR, upstream and downstream of the —47 processing 
site (20). Determinants of petD and rbcL RNA stability 
reside in sequences adjacent to the 5'-end of the mature 
mRNA (13,50,51). There are still numerous genes 
encoding helical-repeat proteins with unassigned functions 
and targets in the Chlamydomonas genome (13,14,52), and 
sRNAs could be valuable tools to identify their binding 
sites. In general, the large set of unassigned sRNAs 
identified here could be a potential source for not only 
the identification of novel ds-acting elements, but also a 



source of baits to biochemically fish for the associated 
RNA-binding proteins required for RNA stabilization 
and processing. 

C7s-acting targets of Mbbl required for RNA stabilization 

Previous work had shown that both the psbB and psbH 
RNAs are targets of the nucleus-encoded protein Mbbl, 
and that the psbB RNA is subject to 5' exonucleolytic 
degradation in the absence of Mbbl (19). Our genetic 
analysis identifies a c/s-acting element in the 5' UTR, 
which is essential for the stable accumulation of the 
psbB and psbB/T transcripts. These RNAs were com- 
pletely absent in m26-31, and reduced to about one-third 
of WT levels in the flanking m21-26 and m32-37 mutants. 
The mutations in m21-26 and m26-31 span a sequence of 
seven bases (S-box: AAGUAAA), which is also found 
at position -43 to -49 in the short 5' UTR of psbH 
(Figure 4A) and is present at the core of both sRNAs 
that originate from the two UTRs. Here we show that 
the S-box is essential for proper psbH mRNA maturation 
in vivo. We observe that Mbbl associates with the psbB 
and psbH 5 f UTR in vitro, and that the interaction can be 
competed with the WT RNA 10 times more efficiently 
than with the mutant RNAs carrying a substitution of 
the S-box, indicating that these interactions are dependent 
on the S-box. We also detect in vivo small RNAs centered 
on the S-boxes of both psbB and psbH, whose presence is 
dependent on Mbbl. Taken together, the data indicate 
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that Mbbl associates with the 5' UTRs of psbB and psbH 
through stable interactions with their respective S-boxes 
and that these are important for the maturation and sta- 
bility of the two transcripts. 

It is interesting to note that both Mbbl and its target 
sequence have been conserved during evolution, as 
the ortholog HCF107 binds a sequence in maize and 
Arabidopsis whose 7-nt core (AAGUcAA) is closely 
related to the S-box (AAGUAAA) (24). Assuming a 
mode of RNA-protein interaction where one TPR/HAT 
repeat recognizes one base (53), it is expected that the 
binding site of Mbbl, which harbors 10 repeats (31), is 
longer than the S-box. This is compatible with our obser- 
vation that the levels of psbB RNA are affected in three 
consecutive linker-scan mutants. While the S-box, defined 
as the sequence conserved in psbB and psbH, is at position 
—22 to —28, the loss of psbB mRNA is strongest in the 
mutant affecting —26 to —31, with a weaker effect in the 
mutant affecting —21 to —26. A longer target sequence of 
Mbbl would also be compatible with the length of the 
sRNA that we identified (21 nt). 

In those mutants that still accumulate measurable levels 
of psbB RNA, our primer extension analysis showed that 
the transcripts end at —35, even when the actual sequence 
around that site is altered (m32-37). Processing of the psbA 
and psbD RNAs is also independent of the sequence at the 
processing site (20,45). We have previously presented 
evidence that a 5 -3' exonucleolytic activity efficiently 
degrades the longer —147 transcript both in the WT and 
in mbbl-222E (19). Thus, the most likely model for the 
role of Mbbl in stabilization of psbB RNA is that it stably 
binds the RNA through the S-box and arrests the 5 -3' 
processing activity at position —35, thus creating and pro- 
tecting the mature 5'-end, possibly in association with 
other accessory factors. Evidence for a similar mechanism 
has previously been presented for pet A and Meal, 
psbD and Nac2, as well as for petD and Mcdl in 
Chlamydomonas (20,21,54), and for several plant chloro- 
plast transcripts, including psbH, which is protected by 
HCF107 at its 5'-end (24). Mbbl and its plant ortholog 
HCF107, as well as Nac2 (which protects psbD in 
Chlamydomonas), belong to the TPR family of helical 
repeat proteins, and more specifically to its HAT variant 
(15). There is strong evidence that members of the PPR 
family act in a similar manner by stably binding to specific 
chloroplast transcripts and protecting them from 
exonucleolytic degradation (22,23,55). In some cases, it 
is remarkable that by binding in the intergenic region of 
a polycistronic transcription unit, the same PPR protein 
can protect both the upstream transcript from 3' degrad- 
ation and the downstream transcript from 5' degradation 
(22,56). 

Different functions of Mbbl for its two target mRNAs 

When the S-box was mutated in the psbH leader, accumu- 
lation of the respective transcripts was also affected: the 
mature forms of the mRNA were missing, and longer 
transcripts with a 5' extension accumulated (Figure 4). 
This was not the case with the mutants of the psbB 5 r 
UTR for which we did not observe an over-accumulation 



of the longer precursor (Figure 3). Apparently, the role of 
Mbbl in allowing the proper maturation of the psbH 
mRNA is somewhat different than with psbB. To 
explain the accumulation of the long forms of the tran- 
scripts in the S-box mutants of psbH, one possibility is 
that association of Mbbl with the S-box contributes to 
the recruitment of an endonuclease. This could occur 
through protein-protein interaction, or by a change in 
the RNA structure on Mbbl binding that would expose 
a susceptible site upstream of the Mbbl binding site. Thus, 
Mbbl would play a dual role in recruiting an endonucle- 
ase and in protecting the mature psbH RNA against 
exonucleolytic degradation. In higher plants, it was sug- 
gested that RNAse J could account for both endo- and 5' 
exo-nuclease processing activities (22,57,58). 

We observed in the psbB S-box mutants, where the 
amounts of psbB and psbB/T RNAs are strongly 
reduced, that the levels of the psbH RNAs are enhanced 
compared with the WT. Conversely, in the mutants of the 
S-box in psbH, the levels of psbB transcripts are elevated. 
A possible interpretation of these results could be that the 
psbB and psbH transcripts compete for a common factor, 
possibly Mbbl, that would enhance the stability of the 
RNAs and be present in limiting amounts. Experiments 
with transgenic tobacco chloroplasts have offered 
evidence for limiting trans-acting factors involved in 
RNA maturation. Over-expression of a chimeric marker 
with the 5' UTR of clpP led to reduced processing (59). 
Likewise, ectopic over-expression of an RNA editing sub- 
strate resulted in reduced editing of the endogenous tran- 
script (60,61). However, our attempts to introduce by 
chloroplast transformation an additional ectopic copy of 
the Mbbl binding site did not result in a measurable 
decrease in the levels of psbB RNA. 

In conclusion, we have identified a conserved c/s-acting 
element that is essential for processing of the chloroplast 
transcripts psbB/T and psbH and is a target of the 
RNA-binding protein Mbbl. Chloroplast reverse 
genetics corroborated the biological significance of the 
sRNAs that were mapped to the 5'-end of these 
transcripts. 
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